Introduction

In this project we want to investigate the correlation between the diets and beak measurements of birds. We want to figure out if you can predict a bird diet based on their beak measurements alone.

The Bird Beak

A bird’s beak anatomy can vary greatly in size and shape depending on its habitat and diet. A bird uses its for more than just eating, they are also used to build nests and sometimes to fight of other creatures invading their territory.

Depending on the function a beak is required to carry out for a certain diet, the beak shape and size can vary greatly. e.g.

  • A cone shaped beak is used for picking up and cracking seeds easily.

  • Broader and flatter beaks are needed to create a larger surface area for bug catching in the air.

  • For bug feeding on the groud, beaks are more tweezer-like.

  • For meat eating, very strong beaks are required with the tip pointing downwards in order to hook onto the meat easier.

  • Long thing beaks with a spear-like tip are great for catching fish.

figure 1: Bird Beak Anatomy

figure 1: Bird Beak Anatomy

The bird beak is made up of the cere, nare, culmen, maxilla, mandible and gape fillange (see fig. 1). It’s useful to note that both the nare and culmen are lengths of the beak that can be measured easily. They differ only as the nare is from nostril to tip whereas the culmen starts from the skull.

The Diets of Birds

The different diet of birds can be sorted into different trophic level. A trophic level defines the hierarchy of an organism in a ecosystem. The main three trophic levels we will be looking at are:

  • Carnivore: A species whose diet consists of at least 70% of food from consuming live animals.

  • Herbivore: A species whose diet consists of at least 70% of food from consuming plants.

We decided to leave out scavenger birds since they make up less than 1% of our data and so using them may of led to false predictions.

We also left out omnivores, this was due to the overlapping diets that they have in correlation to Carnivores and Herbivores. An omnivore is a species whose diet consists of a mixed consumption of plants and animals at an approximately equal rate. This means that their beak measurements doesn’t have as definite relationships as the Carnivores and Herbivores, which may have also lead to false predictions.

The main trophic levels can be sub categorised into more niche trophics. Within these categories we found there to be a clearer link between the beak measurements and diets therefore we looked at the most common one, Invertivores to make our model. To be categorised as one the following trophic levels the species must get at least 60% of its diet from the corresponding food resource.

  • Frugivore: fruit.

  • Granivore: seeds or nuts.

  • Nectarivore: nectar.

  • Herbivore: plant materials in non-aquatic systems, including leaves, buds, whole flowers etc.

  • Herbivore aquatic: plant materials in aquatic systems, including algae and aquatic plant leaves.

  • Invertivore:invertebrates in terrestrial systems, including insects, worms, arachnids, etc.

  • Vertivore: vertebrate animals in terrestrial systems, including mammals, birds, reptiles etc.

  • Aquatic Predator: vertebrate and invertebrate animals in aquatic systems, including fish, crustacea, molluscs, etc.

  • Scavenger: carrion, offal or refuse.

  • Omnivore: Species using multiple niches, within or across trophic levels, in relatively equal proportions.

Our Data

Our main data set for this project is an AVONET Supplementary Data Set called “birds_data”. It contains different bird species measurements and behavior; such as their habitat, diet and migration patterns.

We will be focusing on the following variables of this data set that are useful in our project:

  • Species ID: Unique identifier for species (or species group) linking to the Avibase dataset which provide detailed information on taxonomy, synonymy and distribution for the world’s birds.

  • Total Individuals: Number of individual birds measured for each species.

  • Complete Measures: Number of measured birds with a complete set of morphometric trait measurements.

  • Beak Length: The nares length is measured from the front edge of the nostril to the tip of the beak (see fig 2). We’ve chosen to focus on nares length over culmen length (the length from the tip of the beak to the base of the skull) as beak width and depth also refer to the nostril location, whereas the culmen length refers to the skull.

  • Beak Width: Width of the beak at the front edge of the nostrils (see fig 2).

  • Beak Depth: Depth of the beak at the front edge of the nostrils (see fig 2).

figure 2: Bird Measurements Taken

figure 2: Bird Measurements Taken

  • Tarsus Length: Length from the notch between tibia and tarsus, to the bend of the foot (see fig 2).

  • Wing Length: Length from the bend of the wing to the tip of the longest primary feather (unflattened wing) (see fig 2).

  • Kipps Distance: Length from the tip of the first secondary feather to the tip of the longest primary feather (see fig 2).

  • Secondary Length: Length from the bend of the wing to the tip of the first secondary feather (see fig 2).

  • Tail Length: Distance between the tip of the longest tail feather and the point at which the tail protrudes from the skin (see fig 2).

  • General Trophic: Main diet level groups (Carnivore, Omnivore, Herbivore, Scavenger)

  • Niche Trophic: Niche diet level, sub categories of Tropic Level groups.

Methods

Data Tidying

In order to tidy our original data set we checked for official NAs and located them, selected our required columns to reduce the variable amount, then renamed and reordered the columns to keep them clean and making sense.

Data Science Techniques

During our project we have used a multitude of data science techniques. The main ones we have used are as follows:

  • We have tidied and wrangled the data in order to make it clear and concise when analysing.

  • We have imported the original data set and transcribed into csv format to make it easy for us to read and access.

  • For data visualisation we have created many graphs to demonstrate our findings and data.

  • For our model, we are using logistic regression as we are predicting a discrete value.

Results

The Investigation Process

General Trophic Level

When investigating the beak data, there are three variables to consider; beak length (nares), beak width and beak depth. Summarising the data from these 3 columns we get:

##  Beak_Nares_Length   Beak_Width       Beak_Depth    
##  Min.   :  1.60    Min.   : 0.700   Min.   :  1.00  
##  1st Qu.:  8.50    1st Qu.: 3.600   1st Qu.:  3.80  
##  Median : 11.70    Median : 5.000   Median :  5.80  
##  Mean   : 17.06    Mean   : 6.579   Mean   :  8.06  
##  3rd Qu.: 18.00    3rd Qu.: 7.700   3rd Qu.:  9.40  
##  Max.   :389.80    Max.   :88.900   Max.   :110.90

We can visualise these values in a box plot whilst separating them into their respective General trophic levels:

Discounting the limited data for scavenger birds, these box plots show beak length doesn’t differ greatly between trophic levels on average. We can see that Herbivores on average have the widest and deepest beaks but are slightly shorter than the other trophics.

The trophic levels don’t have equal amounts of data, generally the carnavores have many more entries compared to any other column. We don’t believe this is due to sampling bias, just due to the fact there are likely more carnivore species of bird in existence compared to others.

Niche Trophic Level

As from above you can see that there wasn’t a clear disntiction of the bird measurements between different General Trophic Levels. So we decided to have a look at the Niche Trophic Levels.

talk about the different means medians etc idk.

heres a graph just showing the means

see about the mean etc

Modelling the Data

In order to answer our question of is it possible to predict diet based on beak measurements, we will need to use a logistic regression model since this is a discrete variable case.

For our model we will be predicting whether or not a species of bird is a certain trophic level based on its predicting factors of beak length, beak depth and beak width.

Initially, we wanted to try and see whether one predictor (just beak depth) would be a better predictor model than the three predictors, length, width and depth.

Our first model, model-1, uses only beak depth as a predictor value to predict whether the bird is an Invertivore or not. We then made a second model for Invertivores that uses all 3 predictor values (beak length, width and depth).

Comparing model-1 to model-c we first look at their equations:

  • Model 1: predicted carnivore true = 0.476 - 0.033 * Beak_Depth

  • Model c: predicted carnivore true = 0.410 + 0.030 * Beak_Nares_Length - 0.002 * Beak_Width - 0.086 * Beak_Depth

stuff about what this means :)

The ROC of models 1 and c are below:

Comparing model-c to model-h and to model-o we first look at their equations:

  • Model c: predicted carnivore true = 0.410 + 0.030 * Beak_Nares_Length - 0.002 * Beak_Width - 0.086 * Beak_Depth

  • Model h: predicted carnivore true = -1.185 - 0.039 * Beak_Nares_Length - 0.002 * Beak_Width + 0.106 * Beak_Depth

  • Model o: predicted carnivore true = -1.523 - 0.002 * Beak_Nares_Length + 0.0019 * Beak_Width - 0.023 * Beak_Depth

stuff about what this means :)

The ROC of models c, h and o are below:

Discussion – Answers your research question(s). What are the advantages/limitations of the investigation you have performed? If applicable, are there any ethical considerations in relation to your investigation/findings?

references: - figure 1: https://www.vecteezy.com/png/25217819-watercolor-painted-beautiful-bird-hand-drawn-colorful-bird